fix: update gemini-live model to use realtime mode instead of chat by Chesars · Pull Request #18009 · BerriAI/litellm

Chesars · 2025-12-15T21:48:34Z

Title

fix: update gemini-live model to use realtime mode instead of chat

Relevant issues

N/A

Pre-Submission checklist

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - N/A (JSON config only)
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem

Type

🐛 Bug Fix

Summary

The gemini-live-2.5-flash-preview-native-audio-09-2025 model was incorrectly configured with mode: "chat" and REST API endpoints, but this model only works with WebSockets (Realtime API).

Changes

Change mode from "chat" to "realtime"
Update supported_endpoints from ["/v1/chat/completions", "/v1/completions"] to the correct realtime endpoints:
- gemini/ prefix: /v1/realtime
- vertex_ai/ prefix: /vertex_ai/live

Files changed:

model_prices_and_context_window.json
litellm/model_prices_and_context_window_backup.json

vercel · 2025-12-15T21:48:38Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Review	Updated (UTC)
litellm	Ready	Preview, Comment	Dec 17, 2025 2:05am

ishaan-jaff

Please leave it on the catalog, we support it through pass through live API requests

I suggest fixing the supported endpoints to not include /chat, /completions

The gemini-live-2.5-flash-preview-native-audio-09-2025 model only works with WebSocket (Live API), not REST endpoints. Changed supported_endpoints from /v1/chat/completions to /vertex_ai/live to reflect the actual passthrough endpoint available in LiteLLM proxy.

The gemini/ prefix indicates Google AI Studio, which uses /v1/realtime endpoint (OpenAI-compatible), not /vertex_ai/live.

Chesars · 2025-12-15T23:10:23Z

Please leave it on the catalog, we support it through pass through live API requests

I suggest fixing the supported endpoints to not include /chat, /completions

Thanks for the feedback! You're right - Updated the PR to fix supported_endpoints:

gemini-live-* (vertex_ai) → /vertex_ai/live
gemini/gemini-live-* (gemini) → /v1/realtime

krrishdholakia · 2025-12-16T02:15:47Z

shouldn't you also fix the mode of the model? as it's not a chat model. not sure what mode we have for realtime models, but i assume it's just realtime?

The mode field is used by health checks to determine the correct check method (WebSocket for realtime vs REST for chat).

Chesars · 2026-01-18T19:08:23Z

shouldn't you also fix the mode of the model? as it's not a chat model. not sure what mode we have for realtime models, but i assume it's just realtime?

Exactly, updated in 763b00a

Chesars · 2026-03-04T22:44:29Z

Closing as superseded by #22814.

vercel bot deployed to Preview December 15, 2025 21:50 View deployment

ishaan-jaff requested changes Dec 15, 2025

View reviewed changes

Chesars added 2 commits December 15, 2025 19:55

fix: use /v1/realtime for gemini/ provider live model

c8160fd

The gemini/ prefix indicates Google AI Studio, which uses /v1/realtime endpoint (OpenAI-compatible), not /vertex_ai/live.

Chesars force-pushed the fix/remove-gemini-live-model-from-catalog branch from 56cb9c2 to c8160fd Compare December 15, 2025 23:00

vercel bot deployed to Preview December 15, 2025 23:02 View deployment

Chesars requested a review from ishaan-jaff December 15, 2025 23:16

fix: update mode to realtime for gemini-live models

763b00a

The mode field is used by health checks to determine the correct check method (WebSocket for realtime vs REST for chat).

vercel bot deployed to Preview December 17, 2025 02:05 View deployment

Chesars changed the title ~~fix: remove gemini-live model from catalog (WebSocket only)~~ fix: update gemini-live model to use realtime mode instead of chat Jan 18, 2026

Chesars mentioned this pull request Mar 4, 2026

fix: update gemini-live model endpoints and mode to realtime #22814

Merged

3 tasks

Chesars closed this Mar 4, 2026

Chesars deleted the fix/remove-gemini-live-model-from-catalog branch March 4, 2026 22:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: update gemini-live model to use realtime mode instead of chat#18009

fix: update gemini-live model to use realtime mode instead of chat#18009
Chesars wants to merge 3 commits intoBerriAI:mainfrom
Chesars:fix/remove-gemini-live-model-from-catalog

Chesars commented Dec 15, 2025 •

edited

Loading

Uh oh!

vercel bot commented Dec 15, 2025 •

edited

Loading

Uh oh!

ishaan-jaff left a comment

Uh oh!

Chesars commented Dec 15, 2025

Uh oh!

krrishdholakia commented Dec 16, 2025 •

edited

Loading

Uh oh!

Chesars commented Jan 18, 2026 •

edited

Loading

Uh oh!

Chesars commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

Chesars commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Title

Relevant issues

Pre-Submission checklist

Type

Summary

Changes

Uh oh!

vercel bot commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ishaan-jaff left a comment

Choose a reason for hiding this comment

Uh oh!

Chesars commented Dec 15, 2025

Uh oh!

krrishdholakia commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Chesars commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Chesars commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Chesars commented Dec 15, 2025 •

edited

Loading

vercel bot commented Dec 15, 2025 •

edited

Loading

krrishdholakia commented Dec 16, 2025 •

edited

Loading

Chesars commented Jan 18, 2026 •

edited

Loading